Large Margin Trees for Induction and Transduction
نویسندگان
چکیده
The problem of controlling the capacity of decision trees is considered for the case where the decision nodes implement linear threshold functions. In addition to the standard early stopping and pruning procedures, we implement a strategy based on the margins of the decision boundaries at the nodes. The approach is motivated by bounds on generalization error obtained in terms of the margins of the individual classifiers. Experimental results are given which demonstrate that considerable advantage can be derived from using the margin information. The same strategy is applied to the problem of transduction, where the positions of the testing points are revealed to the training algorithm. This information is used to generate an alternative training criterion motivated by transductive theory. In the transductive case, the results are not as encouraging, suggesting that little, if any, consistent advantage is culled from using the unlabelled data in the proposed fashion. This conclusion does not contradict theoretical results, but leaves open the theoretical and practical question of whether more effective use can be made of the additional information.
منابع مشابه
Univariate Decision Tree Induction using Maximum Margin Classification
In many pattern recognition applications, first decision trees are used due to their simplicity and easily interpretable nature. In this paper, we propose a new decision tree learning algorithm called univariate margin tree where, for each continuous attribute, the best split is found using convex optimization. Our simulation results on 47 data sets show that the novel margin tree classifier pe...
متن کاملLatent Structure Perceptron with Feature Induction for Unrestricted Coreference Resolution
We describe a machine learning system based on large margin structure perceptron for unrestricted coreference resolution that introduces two key modeling techniques: latent coreference trees and entropy guided feature induction. The proposed latent tree modeling turns the learning problem computationally feasible. Additionally, using an automatic feature induction method, we are able to efficie...
متن کاملتأثیر اجرای شیوه تکگزینی بر فراوانی و مشخصات درختان قطور (سالم، پوسیده و خشکهدار) در جنگل ناو اسالم در شمال ایران
Rotten and dead trees are the main component of forest ecosystems and play an important role in maintaining forest biodiversity. In this research frequency and characteristics of large diameter trees (normal, rotten, and dead trees) with diameter at breast height greater than 60 cm were studied in two compartments (selective logged and protected) in Asalem-Nav forest. Random systematic sampling...
متن کاملInvestigation and Determine of Ecological Characteristics of Sites of some old Broad-leaf and needle-leaf Trees in Zagros forests (Case study: Forests of Ilam Province)
. Introduction Old trees are important and key elements of forest sites and are of great value in terms of forest management, reforestation, silviculture and ecology. Although old trees constitute a small percentage of forest trees, they account for a large share of forest carbon reserve and play a vital role in carbon storage. Understanding the how geographical and site distribution of thes...
متن کاملThe Ramsey numbers of large trees versus wheels
For two given graphs G1 and G2, the Ramseynumber R(G1,G2) is the smallest integer n such that for anygraph G of order n, either $G$ contains G1 or the complementof G contains G2. Let Tn denote a tree of order n andWm a wheel of order m+1. To the best of our knowledge, only R(Tn,Wm) with small wheels are known.In this paper, we show that R(Tn,Wm)=3n-2 for odd m with n>756m^{10}.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999